Model Selection

Multimodal diffusion model

# Multimodal diffusion model

Cosmos Predict2 2B Video2World

Cosmos-Predict2 is a high-performance pre-trained world foundation model designed for physical AI development, capable of generating physics-aware images, videos, and world states.

Cosmos Predict2 14B Text2Image

Cosmos-Predict2 is a series of high-performance pre-trained world foundation models designed for physical AI to generate physics-aware images, videos, and world states.

Cosmos Predict2 2B Text2Image

Cosmos-Predict2 is a series of high-performance pre-trained world foundation models designed to generate physics-aware images, videos, and world states, which can be used for the development of physics AI.

Gligen Inpainting Text Image

GLIGEN is a diffusion-based grounded text-to-image generation model capable of generating realistic images from text prompts, bounding boxes, and reference images.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase